Learning the opportunity cost of time in a patch-foraging task.

نویسندگان

  • Sara M Constantino
  • Nathaniel D Daw
چکیده

Although most decision research concerns choice between simultaneously presented options, in many situations options are encountered serially, and the decision is whether to exploit an option or search for a better one. Such problems have a rich history in animal foraging, but we know little about the psychological processes involved. In particular, it is unknown whether learning in these problems is supported by the well-studied neurocomputational mechanisms involved in more conventional tasks. We investigated how humans learn in a foraging task, which requires deciding whether to harvest a depleting resource or switch to a replenished one. The optimal choice (given by the marginal value theorem; MVT) requires comparing the immediate return from harvesting to the opportunity cost of time, which is given by the long-run average reward. In two experiments, we varied opportunity cost across blocks, and subjects adjusted their behavior to blockwise changes in environmental characteristics. We examined how subjects learned their choice strategies by comparing choice adjustments to a learning rule suggested by the MVT (in which the opportunity cost threshold is estimated as an average over previous rewards) and to the predominant incremental-learning theory in neuroscience, temporal-difference learning (TD). Trial-by-trial decisions were explained better by the MVT threshold-learning rule. These findings expand on the foraging literature, which has focused on steady-state behavior, by elucidating a computational mechanism for learning in switching tasks that is distinct from those used in traditional tasks, and suggest connections to research on average reward rates in other domains of neuroscience.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Neural Mechanism for the Opportunity Cost of Time

24 Recent interest has focused on a class of decision problems in which subjects encounter options serially and 25 must decide when to leave an option in search of a better one, rather than directly comparing simultaneously 26 presented options. Although such problems have a rich history in animal foraging and economics, relatively 27 little is known about their neural substrates. Suggestively,...

متن کامل

Exploration or exploitation: life expectancy changes the value of learning in foraging strategies

The acquisition of information is a fundamental part of individual foraging behaviour in heterogeneous and changing environments. We examine how foragers may benefit from utilizing a simple learning rule to update estimates of temporal changes in resource levels. In the model, initial expectation of resource conditions and rate of replacing past information by new experiences are genetically in...

متن کامل

Does information sharing promote group foraging?

Individuals may join groups for several reasons, one of which is the possibility of sharing information about the quality of a foraging area. Sharing information in a patch-foraging scenario gives each group member an opportunity to make a more accurate estimate of the quality of the patch. In this paper we present a mathematical model in which we study the effect of group size on patch-leaving...

متن کامل

Cycle Time Optimization of Processes Using an Entropy-Based Learning for Task Allocation

Cycle time optimization could be one of the great challenges in business process management. Although there is much research on this subject, task similarities have been paid little attention. In this paper, a new approach is proposed to optimize cycle time by minimizing entropy of work lists in resource allocation while keeping workloads balanced. The idea of the entropy of work lists comes fr...

متن کامل

Implicit Motor Learning after Unilateral Stroke Using Serial Reaction Time Task

Introduction: Motor skills and learning after stroke are of a great importance. This study aimed at studying implicit learning in unilateral stroke patients using affected hand and comparison with normal subjects. Methods: A serial reaction time task by using a software was applied for studying implicit motor learning in 15 stroke patients and 15 matched normal subjects. In this task 4 squar...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Cognitive, affective & behavioral neuroscience

دوره 15 4  شماره 

صفحات  -

تاریخ انتشار 2015